Scalable Compression and Replay of Communication Traces

نویسنده

  • Michael Noeth
چکیده

Characterizing the communication behavior of larges cale ap p lications is a d iffi cult and cos tly tas k d ue to cod e/s y s tem comp lex ity and their long ex ecution times . A n alternative to running actual cod es is to gather their communication traces and then rep lay them, w hich facilitates ap p lication tuning and future p rocurements . W hile p as t ap p roaches lack ed los s les s s calable trace collection, w e contribute an ap p roach that p rovid es ord ers of magnitud e s maller, if not near cons tant-s ize, communication traces regard les s of the number of nod es w hile p res erving s tructural information. W e introd uce intraand inter-nod e comp res s ion techniq ues of M P I events and p res ent res ults of our imp lementation for B lueG ene/L . G iven this novel cap ability , w e d is cus s its imp act on communication tuning and bey ond . T o the bes t of our k now led ge, s uch a concis e rep res entation of M P I traces in a s calable manner combined w ith d eterminis tic M P I call rep lay are w ithout any p reced ence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ScalaTrace: Scalable compression and replay of communication traces for high-performance computing

We contribute an approach that provides orders of magnitude smaller, if not near-constant size, communication traces regardless of the number of nodes while preserving structural information. We introduce intraand inter-node compression techniques of MPI events that are capable of extracting an application’s communication structure. We further present a replay mechanism for the traces generated...

متن کامل

ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale

Characterizing the communication behavior of large-scale applications is a difficult and costly task due to code/system complexity and their long execution times. An alternative to running actual codes is to gather their communication traces and then replay them, which facilitates application tuning and future procurements. While past approaches lacked lossless scalable trace collection, we con...

متن کامل

Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns

Wait states in parallel applications can be identified by scanning event traces for characteristic patterns. In our earlier work we defined such inefficiency patterns for MPI-2 one-sided communication, although still based on a serial traceanalysis scheme with limited scalability. In this article we show how wait states in one-sided communications can be detected in a more scalable fashion by t...

متن کامل

Intelligent scalable image watermarking robust against progressive DWT-based compression using genetic algorithms

Image watermarking refers to the process of embedding an authentication message, called watermark, into the host image to uniquely identify the ownership. In this paper a novel, intelligent, scalable, robust wavelet-based watermarking approach is proposed. The proposed approach employs a genetic algorithm to find nearly optimal positions to insert watermark. The embedding positions coded as chr...

متن کامل

Summarized Trace Indexing and Querying for Scalable Back-in-Time Debugging

Back-in-time debuggers offer an interactive exploration interface to execution traces. However, maintaining a good level of interactivity with large execution traces is challenging. Current approaches either maintain execution traces in memory, which limits scalability, or perform exhaustive on-disk indexing, which is not efficient enough. We present a novel scalable disk-based approach that su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007